智能论文笔记

Experimental Study on the Imitation of the Human Head-and-Eye Pose Using the 3-DOF Agile Eye Parallel Robot with ROS and Mediapipe Framework

Amirmohammad Radmehr , Milad Asgari , Mehdi Tale Masouleh

分类：机器人

2021-10-31

在本文中，提出了一种模拟人脸和眼睛的方法，其可以被视为计算机视觉技术和神经网络概念的组合。从机械角度来看，使用3-DOF球形并联机器人，其模仿人头运动。在涉及眼球运动的顾虑中，将2-DOF机构连接到3-DOF球形平行机构的端部执行器。为了对模仿具有稳健和可靠的结果，应从面啮合物中提取有意义的信息，以获得面部的姿势，即卷，偏航和俯仰角。为此，提出了两种方法，其中每个方法都有自己的利弊。第一种方法在于借助Google引入的所谓的MediaPipe库，该库是用于高保真身体姿势跟踪的机器学习解决方案。作为第二种方法，模型是由不同姿势的面部图像的聚集数据集进行线性回归模型训练。另外，利用了三维敏捷眼睛并联机器人来示出该机器人用作类似于用于执行3-DOF旋转运动模式的人头的系统的能力。此外，制造3D印刷面和2-DOF眼睛机构以显示整个系统的方式更时尚。基于ROS平台完成的实验测试，证明了追踪人体头部运动的提出方法的有效性。

translated by 谷歌翻译

In classic reinforcement learning algorithms, agents make decisions at discrete and fixed time intervals. The physical duration between one decision and the next becomes a critical hyperparameter. When this duration is too short, the agent needs to make many decisions to achieve its goal, aggravating the problem's difficulty. But when this duration is too long, the agent becomes incapable of controlling the system. Physical systems, however, do not need a constant control frequency. For learning agents, it is desirable to operate with low frequency when possible and high frequency when necessary. We propose a framework called Continuous-Time Continuous-Options (CTCO), where the agent chooses options as sub-policies of variable durations. Such options are time-continuous and can interact with the system at any desired frequency providing a smooth change of actions. The empirical analysis shows that our algorithm is competitive w.r.t. other time-abstraction techniques, such as classic option learning and action repetition, and practically overcomes the difficult choice of the decision frequency.

translated by 谷歌翻译

神经网络无处不在用于教育的应用机器学习。他们在预测性能方面的普遍成功伴随着严重的弱点，缺乏决策的解释性，尤其是在以人为中心的领域中。我们实施了五种最先进的方法，用于解释黑盒机器学习模型（Lime，PermiputationShap，kernelshap，dice，CEM），并检查每种方法的优势在学生绩效预测的下游任务上，用于五个大规模开放的在线在线公开培训班。我们的实验表明，解释者的家属在与同一代表学生集的同一双向LSTM模型中相互重要性不同意。我们使用主成分分析，詹森 - 香农距离以及Spearman的等级相关性，以跨方法和课程进行定量的盘问解释。此外，我们验证了基于课程的先决条件之间的解释器表现。我们的结果得出的结论是，解释器的选择是一个重要的决定，实际上对预测结果的解释至关重要，甚至比模型的课程更重要。源代码和模型在http://github.com/epfl-ml4ed/evaluating-explainers上发布。

translated by 谷歌翻译